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problems on graphs, defined as instances in which the minimum or average 
degree is Q(n/iJ)(ri)) for some function ip(n) = w(l) of the instance size. 
We design new approximation algorithms as well as new polynomial time 
approximation schemes (PTASs) for those problems and establish first 
approximation hardness results for them. Interestingly, in some cases we 
were able to prove optimality of the underlying approximation ratios, under 
usual complexity-theoretic assumptions. Our results for the Vertex Cover 
^ | problem depend on an improved recursive sampling method which could be 

>• ' of independent interest. 
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1 Introduction 

•i-H . 

In order to cope with the intractability of combinatorial optimization problems, 
it is often useful to consider specific families of instances, the properties of 
which can be exploited to obtain polynomial-time algorithms with improved 
approximation guarantees. Dense instances of optimization problems on graphs, 
in which the minimum or the average degree of the input graph is high, are an 
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example of such a family. Typically, an instance of a combinatorial optimization 
on a graph is said to be dense whenever the minimum or average degree of the 
graph is fi(n), where n is the number of vertices. It was proven in Arora, Karger 
and Karpinski [ 1 ] that a wide range of maximization problems restricted to dense 
instances admit polynomial time approximation schemes (PTAS) . Later, Karpinski 
and Zelikovsky Il22l defined and studied dense cases of covering problems. In 
particular, they investigated the approximation complexity of dense versions of 
Set Cover, Vertex Cover and the Steiner Tree problem. Density parameters have 
been used in approximation ratios for various optimization problems (see 1121 1 for 
a detailed survey and H22l [T8l [8]] for the Vertex Cover problem) . 

In this paper, we aim at bridging the gap between dense and sparse instances, 
by considering wider ranges for the density parameters. We define families of 
subdense instances, in which the minimum (or average) degree of the underly- 
ing graph is o(n), but Q(n/ip(n)) for some function ip of the instance size n. We 
consider subdense cases of covering and Steiner Tree problems, for which previous 
results have been obtained in dense cases. We now describe precisely the problems 
and our contributions. 



1.1 Problems and Previous Results 

Vertex Cover. The Vertex Cover (VC) problem asks to find for a given graph G = 
(V, E) a minimum size vertex set C C V which covers the edges of G. We call a 
graph G = (V, E) (n = |V|) everywhere e-dense for e > if all vertices in G have at 
least en neighbors, and average e-dense if the average degree of a vertex in G is at 
least en (that is, the number of edges is a constant fraction of that of the complete 
graph). Currently, the best parameterized ratios for the Vertex Cover problem 
for average and everywhere dense instances are 2/(2 — y/1 — e) and 2/(1 + e), 
respectively 11221 . Imamura and Iwama [18] later improved the former result, by 
generalizing it to depend on both average degree d and the maximum degree A. 
In particular, they gave an approximation algorithm with an approximation factor 
smaller than 2, provided that the ratio d/A>c for some constant c < 1, and 



As for lower bounds, Clementi and Trevisan llTOll as well as Karpinski and 
Zelikovsky [22J proved that the Vertex Cover problem restricted to everywhere 
and average dense graphs remains APX-hard. Later, Eremeev lfi"2l showed that 
it is NP-hard to approximate the Vertex Cover problem in everywhere e-dense 
graphs within a factor less than (7 + e)/(6 + 2e). Finally, Bar-Yehuda and Kehat H2l 
prove that if the Vertex Cover problem cannot be approximated within a factor 
strictly smaller than 2 on arbitrary graphs, then it cannot be approximated within 
factors smaller than 2/(2 — y/1 — e) — o(l) and 2/(1 + e) — o(l), respectively, on 
average and everywhere dense graphs. Imamura and Iwama lfT8l later proved 
that their obtained approximation bound is best possible unless the general vertex 
cover problem can be approximated within 2 — e for some e > 0. 
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Connected Vertex Cover. The Connected Vertex Cover (CVC) problem is a variant 
of the VC problem, the goal of which is to cover the edges of a given connected 
graph with a minimum size vertex set that induce a connected subgraph. Cardinal 
and Levy fl8]] gave the first parameterized approximation ratios for the CVC 
problem. The achieved ratios are 2/(2 — y/1 — e) for the average dense case 
and 2/(1 + e) for the everywhere dense case. Assuming the Unique Game 
Conjecture If2~4ll , the reduction in fl2l] implies that the achieved ratios are the best 
possible. Cardinal and Levy conjectured that similar results due to Imamura and 
Iwama can be obtained in subdense cases and left it for future work. 



Set Cover. A Set Cover instance consists of a ground set X of size n and a collec- 
tion P c 2 X of m subsets of X. The goal is to find a minimum size subset of P that 
covers X, that is, a minimum cardinality subset M C P such that \J xeM x = X. 
The simple greedy heuristic for the Set Cover problem was already studied by 
Johnson |fT9ll and Lovasz II25II . They showed independently that this approxima- 
tion algorithm provides a (1 + In (n)) -approximate solution which is essentially the 
best achievable under reasonable complexity-theoretic assumptions lfT3l1 . 

Bar-Yehuda and Kehat [Ell studied dense versions of the Set Cover problem. 
Their definition of a dense instance generalizes that of the dense Vertex Cover 
problem; they study /^-bounded instances, in which every element of X appears in 
at most k subsets of P (for the Vertex Cover problem, we have k = 2), but where 
n > em k . They generalized the results of Karpinski and Zelikovsky [22 J to this 
setting. 

On the other hand, Karpinski and Zelikovsky Il22l defined e-dense instances 
of the Set Cover problem as instances in which every element of X belongs to 
a fraction at least e of the subsets of P. They showed that such dense instances 
can be approximated within cm(n) for every c > in polynomial time and can be 
solved to optimality in time m°W' 1 ". 



Steiner Tree. In the Steiner Tree problem (STP), we are given a connected graph 
G = (V, E) and a set of distinguished vertices S C V, called terminals. We want 
to find a minimum size tree within G that spans all terminals from S. The Steiner 
Tree problem is one of the first problems shown to be NP-hard by Karp [ 20 ] . A long 
sequence of papers give approximation factors better than 2 ||2~7l O [23l [T71 [26l [7D . 
The current best approximation ratio is 1.39 due to Byrka et al. [7]. Since the 
problem is APX-hard lloll . we do not expect to obtain a PTAS for this problem in 
arbitrary graphs. In particular, it is NP-hard to find solutions of cost less than || 
time of the optimal cost Q9J] . 

Karpinski and Zelikovsky [22tl introduced the e-dense Steiner Tree problem. A 
Steiner Tree is called e-dense if each terminal t e S has at least e| V\ 1 S f | neighbors 
in V\S. They showed that for every e > the e-dense Steiner Tree problem admits 
a PTAS. Later on, Hauptmann IfToll showed that the same scheme even yields an 
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efficient PTAS for the e-dense Steiner Tree problem. 



1 . 2 Our Contributions 

In this work, we consider a natural extension of the e-density: an instance is called 
ip(n) -dense, if the minimum (or average) degree is at least In the remainder, 
we require ip(n) to be and n is a size parameter of the instance. This type 
of instances are called nondense and were considered before for the MAX-CUT 
problem by Fernandez de la Vega and Karpinski |[T4l . 

In Section [2l we show how to modify the sampling procedure of the Imamura- 
Iwama algorithm (II-algorithm) IfTBlO so that the degree condition can be slightly 
relaxed. This yields the same approximation factors in subdense instances where 
ip(n) is slightly sublogarithmic. More precisely, given a graph G with average 
degree d, maximum degree A < | and A = Vt{n/ip(n)), we can compute with high 
probability a solution with approximation ratio 1+d/ 2 (2A) in n o(i) 2 o«>(n)iogiogiog(n)) 

time which is polynomial if ip(n) = 0(log(n)/logloglog(n)). From the previously 
known hardness result, the approximation ratio is best possible unless Vertex 
Cover can be approximated within 2 — e for some e > 0. 

In Section [3l we introduce the Subset Connected Vertex Cover (SCVC) problem 
which generalizes the CVC problem and present a constant-factor approximation 
algorithm for the problem. This algorithm relies on a previously known approx- 
imation algorithm for quasi-bipartite instances of the Steiner Tree problem. By 
combining the approximation algorithm for the SCVC problem with the modified 
II-algorithm, we prove the existence of a randomized approximation algorithm 
with the same approximation factor and running time as for Vertex Cover in non- 
dense instances. In particular, given a graph G with average degree d, maximum 
degree A < | and A = Q(n/ip(n)), we can compute with high probability a 
connected vertex cover with approximation ratio 1+ ^ 2A ^ in n ^2 ^ n ^ l ° sioslos ^ 
time which is polynomial if ip(n) = 0(\og(n)/ logloglog(n)) and quasipolynomial 
if ip(ri) = polylog(n). 

In Section 01 we study ip(n)-dense instances of the Set Cover problem, in 
which every element of the ground set X is contained in at least \P\/ip(n) subsets 
of the family P. In particular, we analyze the performance of the greedy heuristic 
and show that the greedy algorithm for ip(n) -dense instances of the Set Cover 
problem returns a solution of size at most ip(n)\nn. Consequently, it follows 
that ip(n) -dense instances can be solved exactly in time 0(|P| ln(n ^ n )) and unless 
NP C DTIME [n log "■'/'M] 3 the ^(^)-dense Set Cover problem is not NP-complete. 
This algorithm is used as a subroutine for the approximation algorithm for the 
Steiner Tree problem. 

The subdense Steiner Tree problem is tackled in Section [5l An instance of 
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the Steiner Tree problem is said to be ip{n) -dense when every terminal in S is 
connected to at least \V \ S\/ip(n) nonterminals. We prove that these instances 
can be approximated within 1 + § in time n 0<1 )2 (^) . In particular, it yields 
a PTAS for the cases where ip{n) = O(logn), and a QPTAS for the cases where 
ip(n) = polylogn. On the negative side, we show that for every 5, e > 0, the 
Steiner Tree problem restricted to \V \ S| -^-everywhere dense graphs is NP-hard 
to approximate with §§§—£. Our results are listed in the Tables [1] and El (We 
assume tp(n) > 2.) 



Upper Bounds 


VC 


CVC 


SC 


STP 


Subdense: ip(n) = O(logn) 


V>(n) = 0(Efe) 




ip(n) = O(logn) 


if)(n) = O(logn) 


2 -approx. 

±+ 2A 


2 -approx. 

±i_ 2A 


exact 


PTAS 


poly 
time 


poly 
time 


(m°^ 2n ^ 
time 




Mildly Sparse: ifj(n) = polylogn 


same as in |fT8|| 


2 -approx. 

±+ 2A 


exact 


QPTAS 




quasipoly 
time 


O(m polylogri ) 
time 




Nondense: tp(n) = co(l) 


2 -approx. 

±+ 2A 


-V-approx. 

±+ 2A 


exact 


(1 + 5) -approx. 


0(l)20(i>(n) In In Inn) 

time 


0(l)20(f(n) In In Inn) 

time 


time 


n O(l) 2 0(*M) 

time 



Table 1 : Overview Upper Bounds 



Lower Bounds 


VC 


CVC 


SC 


STP 


V5 > 
ijj( n ) = 0{n l - & ) 


V5 > 




V5 > 
iP(n) = \V\S\ 1 - 5 


Ve > 
UGC-hard with 

\ e 

(cf. ffH) 


Ve > 
UGC-hard with 

2 e 

(cf. mi) 




Ve > 
NP-hard with 

263 

262 fc 



Table 2: Overview Lower Bounds 
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2 The Subdense Vertex Cover Problem 



Previous hardness results in Imamura and Iwama IfTBII and Karpinski and Ze- 
likovsky Il22ll imply that approximating subdense instances - in which either the 
average or the minimum degree is o{n) - within a ratio strictly smaller than 2 is as 
hard as approximating the general minimum Vertex Cover problem within a ratio 
smaller than 2. 

However, if we make the hypothesis that the average and maximum degree, 
denoted respectively by d and A, are of the same order, we can obtain better 
approximation results in the subdense cases. Imamura and Iwama ITT811 gave an 
approximation algorithm with an approximation factor smaller than 2 for vertex 
cover, provided that the ratio d/A > c for some constant c < 1, and both are 
^^ '"fogn" )' They show that the obtained approximation bound is best possible 
unless the general Vertex Cover problem can be approximated within 2 — e for 
some e > 0. In what follows, we show how to modify the sampling procedure of 
the II-algorithm so that the degree condition can be slightly relaxed. Before that, 
we first outline their algorithm and state their main results. 

The Imamura-Iwama Algorithm. The algorithm is a refinement of the original 
Karpinski-Zelikovsky method, in which a large subset W of a minimum vertex 
cover is computed, then a 2-approximation is computed on the graph induced by 
the remaining uncovered edges. In general, it is sufficient to compute a subset W 
that has a large intersection with a minimum vertex cover. The following lemma 
bounds the approximation ratio obtained in this way as a function of the number 
of vertices of W that are contained in an optimal solution C. 

Lemma 2.1 (" IfLSlO . Let W be a subset of V. If there exists a minimum vertex cover 
C such that \W D C\ = n 1 and \W\C\ = n 2 , then we can construct in polynomial 
time a vertex cover C such that \C'\ < — n ^_„ 2 \C\. 

The II-algorithm uses both randomization and recursion to construct a suitable 
subset W . It is given in Figure [TJ, where: 




The initial call to the algorithm is made with i — 1 and a well-chosen value for t, 
as described below. 

Main Results. Let P be a path in the recursion tree of the II-algorithm. We 
denote by W P the set of removed vertices corresponding to this path, that is, W P :— 
U^Wf, where Wf is the set Vj chosen at the i-th step on the path P. 

Lemma 2.2. Let Cbea minimum vertex cover ofG. Then with high probability there 
is a path P in the recursion tree such that \ W P DC\ > (1 — o(l))|Wp|. 
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II-Algorithm A(t, i, G) 

Input: Graph G = (V, E), integers t and i, 

s := 2(logn) 2 
if i < t then 

\etH := {v E V : d(v) > r(G)} 

let U G be a set of s vertices from H chosen uniformly at random 

let V G := {H} U {N(v) : v G U G } 
for each Vj eV G , 1 < j < s + 1 do 

Cj := VjUA^i + l.G-Vj) 
end for 

return a minimum size set among {Cj}^} 
else (i = t) 

return a 2-approximation of the minimum vertex cover of G 
end if 



Figure 1 : Imamura-Iwama algorithm. 



A path satisfying the above condition is called a good path. 

To compute the approximation ratio obtained by the II-algorithm, we need to 
define the function 7(G). The function 7(G) is equal to - times the minimum size 
of an optimal vertex cover in a graph with the same number of vertices, average, 
and maximum degree as G. This function can be computed explicitly: 



7(G) 



d 

2A 



if |£|<A(n-A) 
A 'V / ("+ A ) 2 -^ if |£|>A(n-A) 



The following lemma indicates that the size of W can always match this lower 
bound, provided the depth of the recursion is sufficient. 

Lemma 2.3. For any path P of height t, \W P \ > (1 - (1 - ^Yh(G)n. 

We wish to compute a subset W of size arbitrarily close to 7(G)n. From the 
previous lemma, this requires the number of iterations to be t = 0(n/A). The size 
of the recursion tree is which is polynomial only if A = ^(nloglogn/logn). 

From Lemma 12.11 the corresponding approximation factor is then arbitrarily 
close to • In the range of A < |, the inequality \E\ < A(n — A) is always 

satisfied, since \E\ = 4f < 4f < A(n - A). Thus, we have 7(G) = d/(2A) and the 
result follows. 

Theorem 2.1. There is a randomized polynomial-time approximation algorithm 
with an approximation factor arbitrarily close to — ~- for the Vertex Cover prob- 

1+ 2A 

lem in graphs with average degree d and maximum degree A, such that A < | and 

A = fi(nloglogn/ logn). 
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A Modified Sampling Procedure. We now describe a modified version of the 
II-algorithm, in which the sample size s is reduced. We denote by C a minimum 
vertex cover, by Wi the set obtained at the i-th step in a good path, and by W the 

set W := UjWj. 

Lemma 2.4. |WinC| > (1 - |Wi| with probability at least 1 - 1/eA 

Proof. If a fraction at least 1 — l/s/s of H intersects C, then we are done. Other- 
wise, a random vertex v of H is such that v G C with probability at most 1 — 1/ \fs. 
Hence with probability 1 — (1 — l/y/s) s — > 1 — l/e^ (for sufficiently large values 
of n) we get a vertex u ^ C in our sample. The neighborhood N(v) of this vertex 
must therefore be completely contained in C. □ 

Multiplying the probabilities along the path yields the following: 

Lemma 2.5. With probability (1 - l/e^f, we have \W n C\ > (1 - l/y/s)\W\. 

We now analyze the behavior of a modified version of the II-algorithm, in which 
we set the two parameters s and t to the following values: 

s := (In In n — In In In Inn) 2 (1) 
t := aeA (2) 

First, we check that the probability of having an arbitrarily high proportion of 
W belonging to C is constant. From Lemma 12.51 this probability is 

(1 - = (1 - l/e^)^ = ((1 - l/x)T -> l/e a , 

where x = and we assume that n is sufficiently large. Thus, for these 
parameters, the algorithm computes a fraction 1 — o(l) of W is contained in an 
optimal vertex cover with a probability close to l/e a . 

Next, we show that the algorithm runs in polynomial time. The size of the 
search tree is (s + 1)' = (s + l) ae ^, hence we require: 

(s + l)^ = poly(n) (3) 
v/s + lnlns < lnlnn + 0(l). (4) 

For the chosen value of s, we obtain = . I n r , and J~s + In Ins < 

' In m Inn' V 1 — 

In Inn — In In In Inn + In In In Inn + 0(1) = In Inn + 0(1). So for this sample size, 
the running time of the algorithm remains polynomial. 

Finally, from Lemma 12.31 the desired approximation factor is obtained only 
when t — 9 (^). This yields: 

/ n \ ^ ( n \ ^ ( In In Inn 



t/ Vev^/ \ Inn 
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In general, the parameterized running time with ip{n) as parameter can be ex- 
pressed by the following: 

n O(l)( 5 + 1)* = n O(D( 5 + l)O(i) < ^(lnln^)) ® = n O(l) 2 W n)lnlnln(n)) 

Taking into account the above constraint, we also get a quasi-polynomial time 
algorithm in instances with A = Vl{n/ polylogn). 

Theorem 2.2. There is a randomized approximation algorithm with an approxima- 
tion factor arbitrarily close to 2/(1 + dj (2A)) for the Vertex Cover problem in graphs 
with average degree d, maximum degree A < n/2 and A = Q(n/ijj(n)) with running 
time n o(i) 2 o(4>(n)io g io g io g (n)) which i s polynomial if^(n) = O ( i og }°g" ogn ). 

3 The Subdense Connected Vertex Cover Problem 

In this section, we study the approximability of nondense instances of the Con- 
nected Vertex Cover (CVC) problem. Furthermore, we obtain an approximation 
algorithm for a generalization of the CVC problem. 

Cardinal and Levy [8] studied connected vertex covers in everywhere and av- 
erage dense graphs. For both cases, they gave approximation algorithms with 
asymptotically optimal approximation ratios assuming the Unique Game Conjec- 
ture. In the same work, they conjectured that the Connected Vertex Cover problem 
restricted to even less dense graphs can be approximated with an approximation 
ratio smaller than 2 in polynomial time and left it for future work. 

We introduce the Subset Connected Vertex Cover (SCVC) problem which gen- 
eralizes the CVC problem and present a constant-factor approximation algorithm. 
By combining the approximation algorithm for the SCVC problem with the modi- 
fied II-algorithm, we prove the existence of a randomized approximation algorithm 
with approximation ratio less than 2 for the CVC problem in a restricted class of 
subdense graphs running in polynomial time. If we allow quasi-polynomial run- 
ning time, we obtain the same approximation ratio for a larger class of subdense 
instance. 

3.1 The Subset Connected Vertex Cover Problem 

Definition 3.1. Subset Connected Vertex Cover problem (SCVC) 
Given a connected graph G = (V, E) and a set S C V, a feasible solution consists of 
a connected vertex cover Y of G such that S C Y, The task is to find such a Y with 
minimum cardinality. 

The maximal matching heuristic is a simple 2-approximation algorithm for the 
Vertex Cover problem. It consists of choosing all vertices in any maximal matching. 
In order to obtain a 2-approximate solution for the CVC problem, we have to 
extend this concept: For a given connected graph G — (V, E), a connected maximal 
matching (CMM) is a maximal matching M in G such that G[V(M)] is connected. 
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Clearly, V(M) is a 2-approximate solution for the CVC problem and M can be 
computed in polynomial time. 

As a subroutine in our approximation algorithm for the SCVC problem, we will 
use an approximation algorithm for the Steiner Tree problem for special instances 
called quasi-bipartite: An instance of the Steiner Tree problem G = (V, E) and 
terminal set S is called quasi-bipartite, \ZV\S forms an independent set in G. Gropl 
et al. ITT5T1 gave an 1.217-approximation algorithm for quasi-bipartite instances, 
which we will note as A q bsr in the following. For simplicity, we will note the 
approximation ratio of A q bsr as r qbS T- 

Our algorithms works in two phases. The first phase computes a connected 
maximal matching in every connected component of G[y\5']. The second phase 
connects the remaining connected components by using the algorithm A q bsr on 
quasi-bipartite instances of the Steiner Tree problem. 
We formulate now an algorithm Ascvc f° r the SCVC problem. 

Algorithm Ascvc 

Input: G = (V, E) and S C V 

© Si := S and S 2 := 
while E(G[V\Si)) ^ 

compute a CMM M c in G[V\Si) 

starting with a vertex c e V\Si which is connected to a s e S 
S l :=V(M c )US 1 
endwhile 

© contract every connected component C of G[Si] into a vertex sc 

S2 '■= {sc I C is a connected component of GfSi]} 

Let G' be the graph after vertex contraction in G 
© compute a Steiner Tree T for S 2 in G' using Aqbsr 
Return Si U (V(T)\S 2 ) 



Figure 2: Algorithm Ascvc- 

We are ready to prove the following 
Theorem 3.1. Given a connected graph G = (V,E) and S c V, the algorithm 
Ascvc (see Figure |2P an approximation ratio at most max < r qb sT, — isf 

Proof. Let OPT be an optimal connected vertex cover of G such that S C OPT. 
Furthermore, let S 1 / be the set 5i at the beginning of phase © and C a be the 
set of connected components of G[S(]. Since there exists for every connected 
component C G C a a s c G 5 R C, we know that |C D OPT| > 1. Furthermore, 
it implies that the cardinality of C a is at most \S\. Hence, we have to connect 
\S 2 \ < \S\ terminals in the graph G'. Since we computed a connected maximal 
matching in every connected component of C?[V\£], the remaining vertices 
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V\S{ form an independent set in G and therefore, they must form an indepen- 
dent set in G". Thus, we can use an approximation algorithm for the Steiner 
Tree problem restricted to quasi-bipartite instances to connect all the vertices in S 2 . 



In order to analyze the cost of the new vertices introduced in the phase (D, 
we will use an auxiliary graph construction: For every C e C a , we contract the set 
CnOPT into s' c in G. Let G pt be the resulting graph and S' OPT = {s' c \ C e C a }. 
Since we have |C| > \C n OPT\ > 1 for every C e C a , the cost of a minimum 
Steiner Tree T OPT for S' OPT in G pt cannot be lower than the cost of a minimum 
Steiner Tree V for S 2 in G'. We want to lower bound \OPT\ by dividing the 
total cost into the cost per component in C a and the cost to connect the contracted 
components. Hence, we get \OPT\ > Ecec a \CnOPT\ + \V(T PT)\S OPT \. Clearly, 
Y^caCa \C n OPT\ can be lower bounded by | *S'| and the number of edges of the 
computed connected maximal matchings. Finally, we introduce I — V(T')\S 2 . 

We are ready now to analyze the approximation ratio R of the algorithm 
•Ascvc'- 



R < 



< 



\S{\ + \V(T)\S 2 



< 



< 



< 



< 



< 



S[\S\/2 + \S\ + \V(T pt)\S' opt \ 
\S[\ + {cost(T) + l-\S 2 \} 



S{\S\/2 + \S\ + {cost{To PT ) + 1 - \S' OPT \} 
\S(\S\ + \S\ +cost(T) + 1-1521 



S[\S\/2 + \S\ + cost(T) + 1 - \S 2 \ 

S{\S\ + \S\+ r qbST (\S 2 \ + |/| - 1) + 1 - \S 2 



(*) 



\S[\S\/2 + \S\ + (\S 2 \ + |/| - 1) + 1 - \S 2 \ 
S{\S\ + \S\ + (r qbST - l)(\S 2 \ - 1) + r qbST \I\ 




\S[\S\/2 + \S\ + |/| 
S[\S\+r qbST (\S\ + \I\ 



\S[\S\/2 + \S\ + |/| 



=: F 



In (*), we used the facts that cost(T OPT ) > cost(T) and \S 2 \ = \S' C 



OPT\ 



We have to analyze the worst case behavior of the fraction F. Clearly, F cannot 
be smaller than r qbS T- On the other hand, since is the term with the largest 

coefficient in F, we can maximize F by maximizing |S'f\S'|. Therefore, we set 
\OPT\ = \S\ + \S{\S\/2 and \V\ = \S\ + \S[\S\. In due consideration of the 
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constraints, we derive 



l-si + \s(\s\ } 

F < max < r gbS T, ? } = max < r gb sT, ; 

1 9 \S\ + \S{\S\/2j I ' 



\S\/2+\sf\S\/2 



2 

max <^ r qbST , ^ 

1 + W\ 



□ 



3.2 Application to the Connected Vertex Cover Problem 

By combining the modified II-algorithm with Ascvc, we obtain the following ap- 
proximation result. 

Theorem 3.2. There is a randomized approximation algorithm with an approxima- 
tion factor arbitrarily close to 2/(1 + d/ (2 A)) for the Vertex Cover problem in graphs 
with average degree d, maximum degree A < n/2 and A = Vl{n/%l){n)) with running 

time n O{l)20^{n)\ 0? ,\og\og{n)) which ( s 

.polynomial if ^(„) = o 

• quasi-polynomial ifip(n) = polylogn. 

Proof. It is not hard to see that the original II-algorithm can generate with high 
probability a part W of an optimal connected vertex cover, since a connected ver- 
tex cover is a vertex cover with additional properties and the II-algorithm only 
makes use of the necessary requirements of a vertex cover. Since the modified II- 
algorithm takes advantage of the same requirements, we can invoke the modified 
II-algorithm to generate with high probability a part W of an optimal connected 
vertex cover with cardinality \W\ > (cf. Theorem I2.2D . 

Then, we apply the algorithm Ascvc to G and S = W to obtain a connected 
vertex cover of G. For the considered range of values for A, this combined algo- 
rithm has an approximation ratio at most max < r qbS T, — t~ \ ■ But the latter ratio 

I 1+ 2A J 

is always at least 4/3, and thus is always greater than r qbST < 1.217 (cf. 11T51D . □ 



4 The Subdense Set Cover Problem 

Recall that a set cover instance consists of a ground set X of size n and a collection 
P c 2 X of subsets of X, with \P\ = m. We wish to find a minimum subset 
M C P such that UseM S = X. We consider subdense instances of the Set Cover 
problem, generalizing the dense version of the Set Cover problem proposed by 
Karpinski and Zelikovsky [22J. In the e-dense Set Cover problem, every element 
in X belongs to at least a fraction e of the subsets in P. We call an instance of 
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the Set Cover problem ip{n) -dense whenever every element is contained in at least 
m/ijj(n) subsets. 

It is well-known that the greedy algorithm provides a (1 + Inn) -approximate 
solution to the general Set Cover problem, and that a logarithmic approximation 
is essentially the best achievable, under classical complexity-theoretic assump- 
tions IfTBl . The greedy algorithm picks at every iteration the subset that covers 
the largest number of remaining (uncovered) elements. We now consider the be- 
havior of this algorithm for the ip{n) -dense instances of set cover. 

Lemma 4.1. The greedy algorithm for tp(n)-dense instances of the Set Cover problem 
returns a solution of size at most ip(n) Inn. 

Proof. At the first iteration, every element in X is contained in at least m/ip(n) 
subsets. Hence, by the pigeonhole principle, there exists a subset covering at 
least n/ip(n) elements of X. After the i-th iteration of the algorithm, there are 
exactly m — i subsets remaining, but every uncovered element is still contained 
in at least m/ip(n) > (m — i)/ip(n) remaining subsets, as otherwise it would have 
been covered in a previous step. Hence, there remains a subset covering at least 
a fraction l/ip(n) of the uncovered elements. Thus, we reduce the number of 
uncovered elements by a factor 1 — at every iteration. 
The number of iterations required is therefore at most 



Inn 



n -1 



hi 



1 



We assume that ip(ri) > 2 and proceed by deriving an upper bound on the second 
term: 

n -1 



hi 



1 



tp(n) 



hi 



ip(n) — I J 
1 



ln(-?/>(n)) — ln(-?/>(n) — 1) 
"■00) i x ' 



' ip(n)— 1 % 



-dx 



< f (V(n) - (^(n) - 1)) 



ip(n) 



which proves the lemma. 



□ 



From this bound, we know that we can solve the problem exactly by performing 
an exhaustive search in time O ( m lnn '^( n )) . For sufficiently small functions ip(n), 
this is subexponential. Hence, we obtain that unless NP C DTIME[n logn ^}, the 
■0(n)-dense Set Cover problem is not NP-complete. This generalizes the analogous 
observation for the dense Set Cover problem previously made by Karpinski and 
Zelikovsky Il22ll . 
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5 The Subdense Steiner Tree Problem 



In this section, we study the approximability of nondense instances of the Steiner 
Tree problem. In particular, we show that 0(ip(n)) -dense instances of the Steiner 
Tree problem can be approximated to within 1 + 5 in time n°^2°^ ± ^ 1 ' ) . On the 
other hand, we show that for every 5, e > 0, the Steiner Tree problem restricted to 
| K\S* | ^-everywhere dense graphs is NP-hard to approximate with ||| — e. 

Karpinski and Zelikovsky 11221 introduced the e-dense Steiner Tree problem 
where an instance is called e-dense if each terminal t e S has at least e| "KXS'I 
neighbors in V\S. They showed that for every e > the e-dense Steiner Tree prob- 
lem admits a PTAS. On a high level, the scheme finds in every step a non-terminal 
which is adjacent to a large fraction of the actual terminal set, thus reducing 
the cardinality of the terminal set to a constant after iterating this process. The 
remaining instance can be solved by an exact algorithm in polynomial time. 
Hauptmann IfToTl showed that the same scheme actually yields even an EPTAS for 
the e-dense Steiner Tree problem. 

An instance of the Steiner Tree problem on graphs consisting of G = (V, E) and 
terminal set S is called ip(n) -dense if each terminal has at least neighbors in 

\v\s\. 

We show that the Karpinski-Zelikovsky-techniques can be extended to subdense 
cases and prove the following 

Theorem 5.1. tp(n)-dense instances of the Steiner Tree problem can be approximated 
to within 1 + 5 in time n°^2 0{± Tr\ 

By setting ip(n) to O(logn), we obtain the existence of PTAS for subdense in- 
stances of the Steiner Tree problem where every terminal has at least O(^f-) 
neighbors in V\S. 

Corollary 5.1. There exists a PTAS for 0(\ogn)-dense instances of the Steiner Tree 
problem. 

By setting ip(n) to polylog(n), it yields quasi-polynomial time approximation 
schemes. 

Corollary 5.2. There exist quasi-polynomial time approximation schemes (QPTAS) 
for po\y\og(n)-dense instances of the Steiner Tree problem. 



Analysis of the Algorithm Mdstp 

First of all, we will prove a lemma of significant importance to our analysis and 
proof of our main result. It shows that we can extract a large portion of the actual 
terminal set S\ . 

Lemma 5.1. In every iteration of the phase ©, the cardinality of the extracted neigh- 
borhood contained in Si can be lower bounded by \N(v) D Si\ > 44rK. 
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Algorithm Mdstp 

Input: ^(ra) -dense instance G = (V, E), S C V and 5 > 

® S ± := 5 

while IS'il > s(5,ip) :— max{|,2} • ^(n) 

v := argmax{|N(w) n S^u G 7\(5UiV)} 

S(v) := be the star consisting of v and JV(u) n Si 

N := N U {v} and contract S(v) into s„ 

S x := (S 1 \N(v)) U 
endwhile 

© Apply Dreyfus-Wagner algorithm to remaining instance to obtain T D & W 

Return T 5 := T D&lW U \J v£N S(v) 



Figure 3: Modified DSTP algorithm. 

Proof. We define S{ as the set S'i after the i-th iteration of phase ©, S(vi) be the 
star picked in the i-th iteration and be the set of non-terminals after the i-th 
iteration. In addition, we introduce a data structure called contraction tree, which 
will be one of the main tools in our proof. 

Definition 5.1 (Contraction Tree). Given a terminal s Vj e S\\S with i > 0, we 
define recursively the contraction tree T(s Vj ). The root ofT(s Vj ) consists of the node 
s Vj and the child nodes of s Vj are defined as c e S{ where j < % and S{ is the set that 
was contracted into s Vj . All nodes ofT(s Vj ) contained in S are leaf nodes. The subtree 
ofT(s Vj ) atce S{\S is given by T(c). 

First of all, we prove a simple fact concerning the number of non-terminal 
neighbors of nodes in S[\S. 

Fact 5.1. Given a node s Vj e S\\S, the number of non-terminal neighbors of s Vj can 
be lower bounded by ^^j- - height(T(s Vj )). 

Proof. Let s Vk e V(T(s Vj ))\S be a inner node of T(s Vj ) and be the set which was 
contracted into s Vk . Let s' e be the node with d non-terminal neighbors where 
d = min seS .fe{#non-terminal neighbors of s}. Since every contraction involves only 
one non-terminal, the number of non-terminal neighbors of the parent node s Vk of 
s' is at least d — 1 . Therefore, we can conclude that the root s Vj must have at least 
J^yi - height(T(s Vj )) non-terminal neighbors. □ 

Clearly, the remaining non-contracted terminals still have at least neigh- 
bors. Therefore, the number of edges between S{ and N % denoted as \E(S{, N l )\ 
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can be lower bounded by 

\e(sin*)\ > (\s[ns\)¥^+ M*)n(JV*)| 

^ ' teSi\S 

> (|* lM s}WgJ + £ M.^crw) 

> |^|J^l- \{ Sl | Sj is a inner node of T(t)} | 

Since the inner nodes of the contraction tree corresponds to contractions and 
therefore, corresponds to iterations, we get 



\{si | s t is a inner node of T(t)}\ < i and ^(Sj, JV*)| > \S{ 

tes\\s 



\v\s\ . 



After iteration i of the phase ©, we still have | iV*| = \V\S\ — i non-terminals. We 
can assume that \N % \ > since otherwise we are done. Recall that > 2ip(n). By 
pigeonhole principle, we know there must be av e N l with the following number 
of neighbors in S\ : 

icvoi AriM \Qi\\V\S\ ■ \gi\\V\S\ -\S{\ 

\E(S{,N l )\ > gil » > gil % w > \S{\ 



\N*\ ~ | — i - IVXS'l-i _ ip(n) 

Thus, we can pick in every iteration at least terminals. 

But since we contract a star S(v) into s v and add s v to Si, we decreased Si 
only by — 1 . Alternatively, this can be seen as a greedy pick with a slightly 
worse density c • ip(n). In other words, we want to show that there is a c > 1 such 
that — 1 > > . After short calculation, we get \Si\(c— 1) > c- ip(n). Since 
the cardinality of Si is at least 2 • ip(n), we obtain ip{n) > -z^i>{n), which can be 
ensured by c := 2. By combining these two arguments, we conclude that we can 
reduce the set S\ by ^yjy in every iteration. □ 

The next lemma deals with the upper bound on the cardinality of the set N at 
the beginning of phase ©. 



Lemma 5.2. The cardinality of the set N at the end of phase © can he upper hounded 

Proof. According to Lemma 15.11 , we can pick in every iteration at least ter- 
minals. If we define S\ as the set Si after the i-th. iteration of the phase ©, we 
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see that \S{\ < (l — 2^)) \&\- m OI "der to decrease \S{\ to s(5, ip(n)), we have 
to iterate the first phase at least In (- 



s(5,ip(n)) 

lemma 14. 1L we obtain the following inequality 



times. By applying 



|7V| = In 



\S\ 



In 



1 



-1 



□ 

We are ready to prove our main Theorem 15. 11 

Proof. Let T pt be an optimal Steiner Tree for S in G and T 2 be the optimal steiner 
tree for Si computed in the phase ©. For every v e N, we add a set E(T V ) of edges 
to E(G) that form a spanning tree T v for S(y)\{v}. Let G' := (V(G),£(G) U 
\J veN E(T V ) and Tq PT be an optimal steiner tree for S in G'. Clearly, we have 
c(T pt) > c(Tq PT ) since adding edges to a graph cannot increase the cost of an 
optimal steiner tree. By definition of T D & W , we obtain c(T' OPT ) = c(T D & w ) + 
^2 V £N C (^f )• Hence, we can relate the cost of T & with the cost of T' OPT in order to 
analyze the approximation ratio R of the algorithm: 

R < c ^ < E. 6W (|s(«)i-i) + cW 



< 1 + 



c(^ PT ) - £„ eiV (|^)\MI - i) + c(T D&w ) 
|iV| 



E^(l^)\WI-i) + c(T D&w ) 



\n\ ln (i^ffey) 2 • ln (i(i^j) 5 

< 1 + - — -<H v J <1H v y < 1 + (5 

The running time of the Dreyfus-Wagner algorithm IfTTTl is 0(3^ s ^n + 2^ s ^n 2 + 
n 3 ). Hence, the running time of the algorithm Mdstp is dominated by 

n o(i) 2 o( s (w(n))) =n o(i) 2 o(*!i) i n 

5 . 1 Inapproximability Results 

We prove now the following approximation hardness result for the Steiner Tree 
problem restricted to graphs where the degree of every node is at least | V\S'| 1 ~ <5 . 



1-5 



Theorem 5.2. For every 5, e > 0, the Steiner Tree problem restricted to \V\S\ 
everywhere dense graphs is APX-hard and NP-hard to approximate with ||| — e. 

Proof. Berman and Karpinski [4] obtained explicit inapproximability results for 
the Vertex Cover problem in graphs with bounded maximum degree. We will 
combine this explicit lower bound with an approximation preserving reduction 
from the Vertex Cover problem restricted to bounded degree graphs to the Steiner 
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Tree problem. In particular, we will create a special instance of the Steiner Tree 
problem where the degree of every terminal is exactly 2. It will be the starting 
point of our densification process to obtain explicit hardness results on graphs 
with high degree. 

First, we describe the approximation preserving reduction due to Bern and 
Plassman ^ : Given a graph G = (V, E) with maximum degree B as an instance 
of the Vertex Cover problem, we construct the graph Gst = (Vst, E S t) and the 
terminal set S S t ^ Vst- 

For every edge e e E, we introduce a vertex v e and join it with the two vertices 
w, x e e in G S t- Hence, we get Vst = V U {v e \ e 6 E}. Finally, we connect 
all x, y G V in Gst and define Sst '■= {v e \ e <E E}. Clearly, the degree of every 
terminal is 2 and the length of an optimal Steiner Tree is \E\ + \VCopt\ — 1 < 
§ \V\ + \VC pt\ — 1 where VC pt denotes a minimum vertex cover in G. We are 
ready to deduce the explicit lower bound for the special instance of the Steiner 
Tree problem. 

The following hardness result for bounded degree version of the Vertex Cover 
problem is due to Berman and Karpinski: 

Theorem 5.3 (Berman and Karpinski [4]). The Vertex Cover problem is NP-hard to 
approximate with || — e in graphs G with maximum degree A G = 4. 

More precisely, they proved that for every e e (0, 1/2), it is NP-hard to decide 
whether an instance of the Vertex Cover problem restricted to graphs with 
maximum degree A G = 4 with 104n nodes has a vertex cover of size below 
(54 + e)n or above (55 — e)n. 

Combining these results with the Bern-Plassman reduction, we obtain the fol- 
lowing: The Berman-Karpinski graph for the Vertex Cover problem restricted to 
graphs with maximum degree A G = 4 has at most 4 • WAn/2 = 208n edges. There- 
fore, it is is NP-hard to decide whether smt < 208n + (54+e)n — 1 = (262+e)n — 1 or 
smt > 208n + (55 — e)n — 1 = (263 — e)n — 1. Hence, the described special instance 
of the Steiner Tree problem is NP-hard to approximate with approximation ratio 

263 

262 fc " 

Now, we need to increase the degree of every terminal. In order to obtain an in- 
stance with high vertex degree, we associate with every non-terminal n e Vst\Sst 
a set U n of k new vertices and introduce U := {J n ev ST \s S T ^ e c °nstruct the 
graph D = (V(D), E(D)) with V(D) = U U S ST and e\d) T -.= {{s, u}\ueU x ,se 
Sst, {s,x} e E ST } U (T). The terminal set of this instance remains the same 
Sd '■— Sst- To determine the vertex degree of a terminal, we define m := \ Vst\Sd\- 
Furthermore, we choose k := m~ . Notice that the degree of every s e S is at 
least 2k = 2m^ = 2(m^ ■ m) l - & = 2|f/| 1 "' 5 = 2\V(D)\S D \ 1 - S . Next, we analyze 
how the optimal solutions for (Gst, Sst) and (D, Sd) can be related to each other. 
Let OPTst [OPT D ] be an optimal solution for (Gst, S S t) l(D, S D )]. Since it does 
not improve the cost of a feasible solution for (D, S D ) to have more than one ver- 
tex of every U x with x e V S t\Sst, we have to consider only feasible solutions 
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T = (Vt, Et) with \U X D Vt\ < 1 for every x e Vst\Sst- This implies a canonical 
and cost preserving reduction. In particular, we have \OPT S t\ = \OPT D \. All in 
all, we obtain for every e > the same inapproximability factor ||| — e for instances 
where every vertex has degree at least |l / (.D)\S' D | 1 ~ 5 . □ 

6 Summary of Results 

Tables [3]-[5] give now a summary of our main results. 



Problem 


Upper Bound 


Approx. Hardness 


VC 


2 

l+d/(2A) 


TT^A) " S CUGC) ffH 


cvc 


2 

l+d/(2A) 


TT^A) - S CUGQ mi 


STP 


PTAS 





Table 3: Table of subdense results 



Problem 


Upper Bound 


Approx. Hardness 


Run Time 


CVC 


2 

l+d/(2A) 


tt^a) - & cugc) ma 


quasipoly time 


STP 


PTAS 


TT^A) - * (UGC) ffH 


quasipoly time 



Table 4: Table of mildly sparse approximability results 



Problem 


Approx. Ratio 


Run Time 


VC 


2 

l+d/(2A) 


„0(l)oO(^(n) In In Inn) 


CVC 


2 

l+d/(2A) 


n O(l)20(^)(n) In In Inn) 


STP 


1 + 5 


n o(i) 2 o(^) 


SC 


exact 





Table 5: Table of known tradeoffs 
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7 Conclusion 



We have established several approximability results for the subdense instances 
of Covering and Steiner Tree problems, proving in some cases optimality of the 
approximation ratios under standard complexity- theoretic assumptions. A very 
interesting open problem still remains the status of the dense (and of course also 
of the subdense) Steiner Forest problem. Our methods do not apply to this problem 
directly, and some new techniques seem to be necessary. 
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